
//////////////////////////////////////////
//////////////////////////////////////////
The Self-Employment Effects of the EITC in the Gig Economy

by Riley Wilson

National Tax Journal

README
//////////////////////////////////////////
//////////////////////////////////////////

This folder contains all of the code used to create our analysis datasets.

This folder also contains the code necessary to perform all of the analysis discussed in the main paper.

We provide a description here of all of the do files. First, the do files for data cleaning followed
by the do files for analysis.

*****DO FILES*****
**Data Prep Do Files**
(1) acs_data_cleaning.do		-- Uses the 2005-2019 ACS downloaded from IPUMS and creates household level measures of self-employment. It combines this with information on Uber entry and the EITC to construct all necessary measures.


**Analysis Do Files**
(1) master.do            		-- This do file runs all of the figures and tables in the main manuscript.
(2) figure1.do            	 	-- Creates Figure 1 from the main manuscript.
(3) figure2.do            	 	-- Creates Figure 2 from the main manuscript.
(4) figure3.do            	 	-- Creates Figure 3 from the main manuscript.
(5) table1.do            	 	-- Creates Table 1 from the main manuscript.
(6) table2.do            	 	-- Creates Table 2 from the main manuscript.
(7) table3.do            	 	-- Creates Table 3 from the main manuscript.
(8) table4.do            	 	-- Creates Table 4 from the main manuscript.

*****DATASETS*****	
We have also included the necessary datasets to reproduce all of the analysis in the manuscript. We decribe each of these datasets here:

(1) uber_entry_dates.dta
	-This data set contains the Uber entry date at the MSA level. 
	The data is compiled by using the data available in Hall et al. (2018) and
	as hand collected from the Uber newsroom website.
	
(2) state_eitc_panel.dta
	-This data set contains the state EITC percent of the federal by year for all states.
	This data was collected by the author from the NBER.
	
(3) hslesshousehold_ubereitc2005_2019.dta
	-This data set is the main analysis data set, containing self-employment outcomes 
	at the household level from 2005-2019 using the ACS data. This data was 
	downloaded from IPUMS and cleaned accordingly. It has been merged to the Uber
	data as well as the EITC data to be able to measure EITC and Uber exposure.
	
(4) householdall_ubereitc2005_2019.dta
	-This data set is the same as the dataset above but does not restrict the 
	sample on education or marital status. It also limits the variables to only 
	those needed for the heterogeneity table. This data was downloaded from 
	IPUMS and cleaned accordingly.
